Picture for Zizhao Tong

Zizhao Tong

SCOPE: Simulating Cross-game Operations in Playable Environments for FPS World Models

Add code
May 28, 2026
Viaarxiv icon

Incantation: Natural Language as the Action Interface for Multi-Entity Video World Models

Add code
May 18, 2026
Viaarxiv icon

Visual Para-Thinker: Divide-and-Conquer Reasoning for Visual Comprehension

Add code
Feb 10, 2026
Viaarxiv icon

Vision Also You Need: Navigating Out-of-Distribution Detection with Multimodal Large Language Model

Add code
Jan 20, 2026
Viaarxiv icon